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Abstract 

Given a (known) function / : [0, 1] — > (0, 1), wc consider the problem of simulating a coin 
with probability of heads f(j>) by tossing a coin with unknown heads probability p, as well 
as a fair coin, TV times each, where TV may be random. The work of Keane and O'Brien 
(1994) implies that such a simulation scheme with the probability P p (TV < oo) equal to 1 
exists iff / is continuous. Nacu and Peres (2005) proved that / is real analytic in an open 
set S C (0,1) iff such a simulation scheme exists with the probability P p (TV > n) decaying 
exponentially in n for every p £ S. We prove that for a > non-integer, / is in the space 
C Q [0, 1] if and only if a simulation scheme as above exists with P P (TV > n) < C(A„(p)) Q , 
where A n (x) := max{ y/xfY— x)/n, The key to the proof is a new result in approximation 

theory: Let B„ be the cone of univariate polynomials with nonnegativc Bernstein coefficients 
of degree n. We show that a function / : [0, 1] — > (0, 1) is in C Q [0, 1] if and only if / has a 
series representation Y^Li^n with F„ £ and J2k>n^ k ( x ) — C{A n {x)) a for all x £ [0,1] 
and n > 1. We also provide a counterexample to a theorem stated without proof by Lorentz 
(1963), who claimed that if some ip n £ satisfy \f(x) — <p n {x)\ < C(A n (x)) a for all x £ [0, 1] 
and n > 1, then / £ C Q [0,1]. 

1 Introduction 

Given a coin with unknown probability of heads p £ [0, 1], as well as a fair coin, we would like to 
simulate a coin with probability of heads f(p) where / : [0, 1] —> (0, 1) is a known function. This 
means that we are allowed to toss the original p-coin and the fair coin TV times each, where TV is 
an almost surely finite stopping time (a notion to be clarified momentarily) and declare heads or 
tails, depending on the outcome of these 2TV independent coin tosses. The probability of declaring 
a head must be exactly /(p). 
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The measure corresponding to tosses of the p-coin is the infinite product measure P p on f2 = 
{0, l} 1 ^ where in each coordinate the weights (1 — p,p) are used. A measurable function N : — > 
]N U {00} is a stopping time if for every k £ M, the indicator of N = k is a function of the first k 
coordinates in 0. We say that N is almost surely finite if the probability IPp(A < 00) is 1. More 
details on these notions can be found in any graduate textbook in Probability Theory, e.g. |10j . 

This type of problem goes back to von Neumann's article [9] where he showed how to simulate 
a fair coin (i.e., f(p) = 1/2) using only a biased p-coin where p € (0,1). Moreover, the number 

/ \ |n/2j 

of tosses N needed satisfies TP P (N > n) < ( 1 — 2e(l — e) J if p € [e, 1 — e]. In this paper we 

include a fair coin in the simulations since we want to consider p near the endpoints {0, 1} where 
simulating a fair coin using a p coin would be slow. 

Since von Neumann's article, the simulation problem was subsequently solved for various sother 
classes of functions - see [2 El E E] . In particular, it was shown in [2] that an /(p)-coin can be 
simulated using finitely many tosses of a p-coin for all p in a closed interval D C (0, 1) if and 
only if / is continuous in D. In [5], it was shown that for / : [0,1] — > (0,1), an /(p)-coin can 
be simulated using finitely many tosses of a p-coin via a finite automaton for all p £ (0, 1), if an 
only if / is a rational function over <Q. (Simulation via a finite automaton is explained in detail 
in [5]. An automaton is determined by a finite state space, a finite input alphabet, a transition 
rule from current state and input symbol to the next state, and a subset of final states. In our 
context, there are two final states, denoted and 1, and we require that when the automation is 
given independent tosses of a p-coin as input, it will reach a final state with probability one, and 
output 1 with probability f(p).) 

In [6], it was shown that if D C (0, 1) is closed and / is real-analytic in an open neighborhood 
of D, then there is a simulation of an /(p)-coin using N tosses of a p-coin where N has uniform 
exponential tails for p € D, and conversely, if a simulation with exponential tails exists for p in 
an open set S C (0,1), then / is real analytic in S. Moreover, the problem of simulation was 
recast in [6] as an approximation problem, and the question of characterizing simulation rates for 
non-analytic functions was posed. 

Definition 1. Given a simulation algorithm, its simulation rate is the probability TP p (N > n) 
that the number of required inputs exceeds n. (Each input is a toss of a p coin and a toss of a fair 
coin). If a simulation algorithm with TP p (N > n) = 0{ip n {p)) exists, we say that the function f 
can be simulated at the rate tp n {p)- 

The goal of this paper is to show that the simulation rate is determined by the smoothness of 
the simulated function /. Our main result is that for positive a ^ IN, a function / : [0, 1] — > (0, 1) 
is in the space C a (defined by a Holder condition of order a — r on the derivative of order r :=[aj) 
if and only if / can be simulated at the rate A n (p) a , where A n (x) :=max{y / x(l — x)/n, 1/n}, see 
Theorem [8] below. 

2 Preliminaries and statement of results 

We first recall relevant definitions and results from the literature on this problem and from approx- 
imation theory. Recall that the univariate Bernstein polynomials of degree n (see, e.g., [1]) are 
defined as 




k = 0, . . . , n. 



(1) 
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The Bernstein polynomials of degree n form a basis for the space II n of all polynomials of degree 
at most n. Thus, any polynomial q of degree at most n can be written as 



k=0 

with the sequence (ao, • • • ,a n ) the degree n Bernstein coefficients of q. Whenever we write 

q G B n , 

this indicates that q is already represented as a linear combination of the Bernstein polynomials of 
degree n; this is admittedly an abuse of notation since the meaning of ll q £ B n " differs from that 
of "g 6 n n " . In addition, we write 

whenever the degree n Bernstein coefficients (ao, . . . ,a n ) of q are nonnegative. We will also need 
the following partial order on the space Tl n : 

Definition 2. Given q, r S Ii n , we write q ^ n r, or r y n q, to denote that r — q £ ■ 

Result [3] below was established in [6] using a simple probabilistic construction. This result 
reduces the original simulation question to a problem in approximation theory, which we address in 
this paper. In [6] the goal was to obtain a simulation for p in a closed subset of (0, 1); in this case a 
fair coin is not needed, as it can be produced from the p coin using the von Neumann algorithm. In 
the present paper we allow p to range in the whole interval [0, 1], so we use a fair coin in addition 
to the unknown p-co'm. 

Result 3 ([6]). If there exists an algorithm that simulates a function f on a set D C [0, 1] using a 
random finite number N of tosses of a p-coin, then for all n > 1 there exist univariate polynomials 

9n(x) := £ (fj a(n, k)x k (l - x) n ~ k , h n (x) := £ ff) b(n, k)x k (l - x) n ~ k (2) 
k=0 ^ ' k=0 ^ ' 

with the following properties: 

(i) < a(n,k) < b(n,k) < 1; 

(ii) (^)a(n, k) and (fyb(n,k) are integers; 

(iii) g n (p) < f(p)< K(p); 

(iv) for all m < n we have g m -< n g n and h m ^ n h n ; 

(v) h n {p)-g n {p)=W p {N >n). 

Conversely, if there exist polynomials g n , h n as in satisfying (i) - (iv) with lim n h n {p)—g n {p) = 
for all p £ D, then there exists an algorithm that simulates an f(p)-coin for all p £ D using N 
tosses of the p-coin, where the random time N satisfies 

IP p (N > n) = h n {p)-g n (p). 
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As noted in [6], given polynomials g n , h n that satisfy all the requirements except (ii), one can always 
round the values (^)a(n,fc) down and the values (^)6(n, k) up to an integer, and the resulting 
increase in the gap h n (p) — g n (p) is exponentially small in n provided that p £ [e, 1 — e] for some 
e > 0. In the setting of the present paper, when the p-coin is tossed n times we also toss a fair coin 
n times; this means that condition (ii) above is replaced by 

(ii') (^)a(n, k) and (fyb(n, k) are integer multiples of 2~ n , 

since the probabilities of events that can be generated by tossing a fair coin n times are precisely 
the integer multiples of 2~ n . Thus, given polynomials g n ,h n that satisfy requirements (i), (hi) and 
(iv), we can round the values Qf)a(n, k) down and the values (2)b(n, k) up to the nearest multiple 
of 2 _n ; this will only add at most 2 1_n to the gap h n — g n . 

Therefore (up to an additional error term of 2 1_n ), the problem of determining the rate of 
simulation in our setting is equivalent to the problem of determining the order of two-sided approx- 
imation to /, by polynomials g n , h n G B n that satisfy requirements (i), (hi) and (iv). We will refer 
to requirements (iv) as the consistency requirements, to the approximation scheme (g n ) as a 
Bernstein-positive consistent approximation from below, and to the approximation scheme 
(h n ) as a Bernstein-positive consistent approximation from above. 

Observe that a Bernstein-positive consistent approximation to a function / from below is equiv- 
alent to a certain nonnegative series representation of /. Here is a precise statement. 

Lemma 4. Let D C [0, 1] and let (ip n ) be a nonincreasing sequence of positive functions on D 
that converges uniformly to 0. A function f is approximable on D by a sequence of Bernstein- 
nonnegative polynomials (g n ) of degree n satisfying the consistency requirement (iv) 

9rn <n 9n for all 71 > TO (3) 

and the estimate 

< f{x) - g n (x) < Mx) for all x G D (4) 
if and only if f can be represented as a series 

oo 

f(x) = F n {x) where F n {x) < i/jn(%) for all x G D, (5) 

n=0 n>N 

where each F n is a polynomial in Bernstein form of degree n with nonnegative coefficients. 

Proof. Given an approximation scheme (g n ) as above, set F n {x) := g n (x) — g n -i(x) where the sec- 
ond term g n -i(x) is rewritten in Bernstein form of degree n and where go(x) :=0. The consistency 
requirement ([3]) then guarantees that the Bernstein coefficients of F n are nonnegative, and the sum 
J2 n >N Fn(x) telescopes into f{x) — <7tv(x), which is bounded pointwise by iPn{x) according to (jl]). 

Conversely, given a series representation ([5]), let g n (x) := J2k<n Fk(x). Since the difference 
F n {x) := g n {x) — g n -\{x) is a Bernstein polynomial with nonnegative coefficients, the polynomials 
{g n ) satisfy the consistency requirement ([3]). Also, f{x) — g n {x) = ^2k>n Fk(x) < ip n (x) due to the 
rate condition ([3]). □ 

This approximation problem can be contrasted with the classical approximation of a given 
function by (unrestricted) polynomials of degree at most n on the interval [0, 1]. In that case, the 
approximation order coincides with the smoothness of /. To state this classical result precisely, we 
first recall how smoothness is measured. 
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Definition 5. Let a > with a ^ IN. A function f is said to be in the smoothness class C a [0, 1] 
if f is r :=[a\ times differentiable and the following condition holds: 
The modulus of continuity of the rth derivative f^ 

u(f^;h):= sup 

x,ye[0,l], \x-y\<h 

is of order 0{h a ~ r ). In that case, we will use the notation 

u(fV;h) 



C<* := sup ■ 

h>0 



n a-r 




(Note this is a seminorm rather than a norm, as it vanishes on polynomials of degree at most r.) 

The order of approximation of a given function by polynomials is then determined as follows. 

Result 6 (see, e.g., [TJ Chapter 8, Theorem 6.3]). Let a > be a non-integer. There exists a 
sequence of polynomials {p n }, where the degree of p n is at most n, satisfying 

\ Pn {x) - f(x)\ = 0((A n (x)) 2a ) for all x £ [0, 1] , 

if and only if f £ C a [0, 1]. Here the quantity A n (x) is defined by 

A n :=A n (x) :=max 

In other words, the rate of approximation of / £ C a [0, 1] is 0{n~ a ) away from the boundary of 
the interval [0, 1] and is 0(n~ 2a ) close to the endpoints and 1. The characterization of the rate 
of polynomial approximation for integer values a involves the generalized Zygmund class, which we 
will recall in Section [3 In the main part of this paper, we work under the assumption a ^ IN. 

Result [6] shows that a function / £ C Q [0, 1] \ C a+e [0, 1] cannot be simulated at the rate 
0(A 2 n {a+e) ). However, since our approximants must satisfy special restrictions imposed by Result [3j 
we should not expect to achieve the approximation order provided by unrestricted polynomials of 
degree n. 

In view of requirement (i), it is natural to consider first the approximation order achieved 
by polynomials with nonnegative Bernstein coefficients. G. G. Lorentz proposed a solution to 
this problem in [3], where he argued that the approximation order under this constraint is half 
the approximation order provided by unconstrained polynomials, i.e., half the smoothness of the 
function /. Theorem 1 of [3] establishes that a C°-function / can be approximated at the rate 
0((A n ) a ) by Bernstein-nonnegative polynomials of degree n. 

Result 7 ([31 Theorem 1]). Let a > 0. A positive function f £ C°[0, 1] can be approximated by 
polynomials q n of degree at most n with nonnegative Bernstein coefficients at the rate 

\q n (x) - f(x)\ = 0((A n (x)) a ) for all x £ [0, 1]. (6) 

Lorentz [3] also stated (without proof) a converse to this result; unfortunately, that converse is 
incorrect. We return to this point at the end of the section. 

We will use a variant of Lorentz' approach to establish our main result, that with the extra 
requirements (i), (iii), (iv) in place, we can still achieve the same approximation order as in ©. 
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Theorem 8. Let f : [0,1] ->■ (0,1) and let a > twft a IN. If f £ C a [0, 1], tfien / can 6e 
simulated at the rate (A n (x)) a on [0,1]. Precisely, there exist polynomials g n and h n satisfying 
conditions (i), (ii'), (Hi) and (iv) of Result^ and h n (x) — g n (x) = 0((A n (x)) Q ) uniformly in [0,1]. 
Conversely, if f can be simulated at the rate (A n (x)) a on the interval [0,1], then f £ C a [0, 1]. 

We begin by proving a reduction lemma that shows that it is enough to find consistent approx- 
imants g n , h n for each 6-adic degree n £ 6^ = {6^}^>i where 6 is a fixed integer greater than 1. 
Using these approximants, one can then interpolate between 6-adic levels to build up a consistent 
approximation scheme providing the same approximation order as the 6-adic polynomials g n , h n , 
n = b e . This 6-adic idea per se is quite well known and, in particular, is used in [6] with 6 = 2. 

Lemma 9. Let b be a fixed integer greater than 1. Given a function f on [0, 1], suppose there exist 
two sequences of polynomials (<?n) ngb iN, {h n ) ne b^ satisfying conditions (i), (Hi), (iv) of Result^ 
with tfj n of order 0((A n ) a ), so that 

K{x) - g n {x) = 0((A n (x)) a ) uniformly in [0, 1] (7) 

for n £ . Then these sequences can be augmented to full sequences (9n) n& z + ' (^™)nez + satisfying 
conditions (i), (Hi), (iv) from Result^ and condition ([?[) for all n £ Z+. In particular, under these 
assumptions there exists an algorithm that simulates an f{p)-coin at the rate 0{{A n {p)) a ) on [0, 1]. 

Proof. Given the polynomials g n and h n for fe-adic values of n, we will fill in the gaps in the two 
sequences in the obvious way: given n, let n ':=b^b n i : and set 

g n {x) :=(x + (1 - x)) n - n 'g n ,(x, y), h n (x) :=(x + (1 - x)) n ~ n> h n , (x , y) 

by expanding the right-hand sides into Bernstein polynomials of degree n. The Bernstein coefficients 
of the resulting polynomials g n , h n are therefore some convex combinations of the coefficients of 
g n >, h n i. It follows that condition (i) holds for the full sequences (g n ), (hn)- It is clear from the 
construction that (iii) and (iv) hold as well, the latter condition being an equality except when 
jumping from one 6-adic level to the next, when it is satisfied by our assumption. Recall that 
condition (ii') can always be satisfied by introducing an exponentially small correction, so there is 
no need to verify it explicitly. To check that ([?]) holds for the full sequences (g n ), (/in), note that, 
by construction, 

K(x) - g n (x) = h n ,(x) - g n >(x) = 0{(A n ,(x)) a ). 
But since n' <n < bn' , we see that 0((A n i(x)) a ) = 0((A n (x)) a ). This completes the proof. □ 

As noted already, Lorentz [3] stated a converse to Result [7] above, which (in a special case) can 
be written as follows. 

Claim 10 ([3j Theorems 5 and 6]). Let a > 0. If a function f can be approximated by polynomials 
q n of degree at most n with nonnegative Bernstein coefficients at the rate (0), then f E C a [0, 1]. 

The argument proposed in [3] for these theorems skips technical details and refers to the work 
of Timan [8]. Specifically, we quote Theorem 6 from [3] and the subsequent discussion: 
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" Theorem 6. For each r = 1, 2, . . . there is a constant C r with the following property. 
Let u(h) be a modulus of continuity, and put 

u{h) = hi — ^ du + — — du. 
Jh u Jo u 

If f(x) is a continuous function on [0, 1] and if there exists a sequence P n (x) of polyno- 
mials with positive coefficients of degree n such that 

\f(x)-P n (x)\<(A n ) r u{A n ), 0<x<l, 71 = 0,1,..., 

then f has on [0, 1] the continuous derivatives f, f", . . ., and 

u(f (r) ;h) < C r u)(h). 

We omit the proofs. The method of deriving theorems of this kind from inequalities of 
the Markov-Bernstein type is due essentially to S. Bernstein, and is well known. For the 
variation of it which fits the present situation especially well, compare [6, p. 357 and p. 
360]. It should be noted that [6, p. 357] contains an essential mistake: the derivative 
Pom+i on P- 359 should have been estimated at a point different from x. However, the 
proof can be corrected. " 

In this quote, [6] refers to the original Russian edition of Timan's work [8]. Trying to reconstruct 
Lorentz' complete argument for his Theorem 6, we came to the realization that his argument 
requires an extra assumption, in fact precisely the assumption of Bernstein-nonnegative consistent 
approximation, or equivalently, the nonnegative series representation ([5]) that is central to this 
paper. In the next section we show that, indeed, such a series representation of / with tails 
decaying at the rate (A n ) Q implies the C a smoothness of the represented function /. Thus our 
results here also provide a correction to the statement of Lorentz. In Section [61 we construct a 
counterexample to Theorem 6 from [3]. 

Our final point in this section concerns notation. In the rest of the paper, we will establish a 
number of estimates on various functions. The constants in such estimates will be usually simply 
denoted by const or, say, const the latter indicating that the constant may depend on j. A few 
constants that are crucial to our main argument will be labeled by the number of the theorem or 
lemma where they occur. 

3 Consistent approximation implies smoothness in Theorem [8] 

Lorentz proved the following analogues of Bernstein's and Markov's inequalities (both original 
inequalities can be found, e.g., in [I]). This result of Lorentz is formulated for a certain class of 
functions £1; we will use it only for the power functions 1 1— >• t J . 

Result 11 ([21 Theorem 3]). For each r = 1, 2, . . . and each H > 0, there is a constant K r = K r (H) 
with the following property. If Q(h) is an increasing positive function defined for all h > such 
that 

Q(2h) < Hn(h), h>0, 
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then for each Bernstein-positive polynomial P n of degree n, the inequality 

P n {x) < n(A n (x)), < x < 1 

implies 

\Pi r) (x)\ < K r (A n (x)r r n(A n (x)), < x < 1. (8) 
We need the following observation. 
Lemma 12. For any x and £ in [0, 1], 



. A n (x)' A n {0 J " V A n(*) 
Proof. We start by proving one of the two inequalities, viz., 



A " W <2fl + I^V (9) 



A„(f) ~ V A n (x 

By the symmetry A n (x) = A n (l — x), we may assume that x,£ G [0, 1/2]. We also assume that 
t; < x and A n (x) > 1/n, since otherwise the inequality is obvious. If ^ > x/2 then the left-hand 
side of @ is at most 2, so we may assume that £ < x/2. In this case we have 

(A n (x)) 2 < x/n < 2\x - e|/n < 2\x - £| A„(0, 

which implies ([9]). 

The proof of the other inequality (which bounds A n (£ ) / A n (x) by the right-hand side of © ) is 
very similar. We may again assume, by symmetry, that x, £ £ [0, 1/2]. We also assume that £ > x 
and A n (£) > 1/n since otherwise the inequality is obvious. Thus A n (£) < \fijn < £. If £ < 2x, 
then the left-hand side is at most 2, while the right-hand side is greater than 2. Thus, the only 
remaining case is £ > 2x. Then 

A n (0 <£<2|£-s|. 

Dividing by A n (x), we obtain the desired bound. □ 

To prove the necessity of C Q -smoothness, we will follow the approach suggested by G. Lorentz 
in [3], which goes back to Timan [8] and ultimately to S. Bernstein. 

Proof of necessity in Theorem [8l Suppose that / can be simulated at the rate (A^)^ 
on the interval [0,1]. Using the sequence (g n ) that approximates / from below and satisfies the 
consistency requirement g n -<2n 92n, we set G n := <?2 n+1 — 92 n and obtain the following nonnegative 
series representation for /: 

oo 

f(x) = Y,G n (x), G n eB+. (10) 

n=0 

By the assumption on the rate of approximation, the polynomials G n satisfy the bound 

\G n (x)\ < const (A 2 n(x)) Q for all x G [0, 1]. 
Now, the inequality ([8]) implies 

\G^(x)\ < const(A 2 n(x)) a - J ' for all x G [0, 1], j G IN. (11) 
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This already ensures that we can differentiate (jlOp term by term r times, and that 

oo 

/ (r) w = E G i r) w. ( 12 ) 

n=0 

is continuous in [0, 1]. Our goal is to prove that / € C Q [0, 1], i.e., that the inequality 

fW(x) -f {r) (y) < const \x - y\ a ~ r (13) 

holds for x,y £ [0, 1]. Without loss of generality x(l — x) > y(l — y), whence A n (x) > A n (y) for 
all n. For any n, there is some £ n between x and y such that 

\GP(x) - G%\y)\ = \x-y\ \G^ +l \Cn)\ < const \x - y\ (A^))"— 1 , (14) 

using the bound (jlip with j = r + 1. Choose TV" so that 

A 2 iv+i(x) < \x — y\ < A 2 n(x) . (15) 

For n < N we have \x — y\ < A 2 n(a;), so Lemma [T2l implies that A 2 n(x) < 4A 2 ™(£ n ). Thus for 
n < N, (fnj) gives 

\G£\x) - G«(y)| < const |x - y| (A 2 » (rr))^" 1 . (16) 
We now write /W(x) - /( r )(y) by splitting the sum (|12[) into two parts: 

V oo 

f (r) {x) _ f (r) {y) = J2(G^(x)-G^( y ))+ (G^(x)-G^(y)). (17) 

n=0 ,1=^+1 



Estimate the first sum using (jT6|) and the second using (|lip with j = r. This yields 

N oo 

< const(| 3 ;-y|^(A 2 n( a ;)r- r - 1 + 2 ]T (A 2 n(x)) Q ^) (18) 

n=0 n=7V+l 

< const |x - y|(A 2 jv (x))""^ 1 + const(A 2 iv + i (x)) a ~ r , (19) 

where we used the inequality A^{x) > v2 A 2 fc(^) to compare the two series in f)18|) to geometric 
series. In view of (fl5|) . the bound (fT9|) yields (fT3|) . □ 

In the preceding proof, the strict inequality a < r + 1 is used only at one point: to show that the 
sum of terms with n < N in (|17p is comparable to the last term. (If a = r + 1 then all these terms 
are of the same magnitude and we lose a factor of N ~ log , 1 , in the estimate.) Nevertheless, for 

6 6 \x-y\ 1 ' 

the case a = r + 1, the same method will allow us to show that f^ is in the Zygmund class. 

Theorem 13. Let r be a nonnegative integer. Suppose that f : [0, 1] — > (0, 1) can be simulated at 
the rate (A n (x)) r+1 on [0,1]. Precisely, suppose that there exist polynomials g n and h n satisfying 
conditions (i), (ii') ; (Hi) and (iv) of Result^ and h n (x) — g n {x) = 0(A 7 n +1 (x)) uniformly in [0,1]. 
Then f^ is in the Zygmund class, that is 

\f(r) {x + 6) _ 2f (r) {x) + f (r) {x _ ^ = Q(§) (2Q) 

uniformly for all x, 5 such that < x — 5 < x + 5 < 1. 



9 



In fact, as in the preceding proof, only the approximation from below by g n is used. 

Proof. The hypothesis implies that / has a series representation as in (|l(jp where the polynomials 
G n G span + {p 2 ™,fc : k = 0, ...,2 n } satisfy \G n (x)\ < const(A 2 "(x)) r+1 for all x G [0,1]. The 
inequality ([8]) implies 

\G$(x)\ < const (A 2 n(x)) r+1 ~ j for all x G [0, 1], j G IN , (21) 

so (fTD holds and /( r ) is continuous in [0,1]. Fix 5 G (0,1/2), and choose N minimal so that 
5 < A 2 n (x) . Write / = S± + S 2 where 

N oo 

Si = ^ G n (x) and 52 = ^ G n (x) . 

n=0 n =N+l 

The preceding proof works to show that S2(x + 5) — S 2 (x) = 0(5), and this implies that the estimate 
pOl) holds with } in place of /W. It remains to handle . 
For any n, there is some r\ n G [x — 5, x + <5] such that 

\G£\x + 6)-2G%\x) + G£\x-S)\ = 5 2 \G^ +2 \ Vn )\ < const 5 2 (A 2n ( Vn ))-\ (22) 

using the bound f)21 1) with j = r + 2. 

For n < iV we have 5 < A2"(x), so Lemma 1121 implies that A 2 n(x) < 4A2"(r/ n )- Thus for 
n < N, ([22]) gives 

\G$(x + 5)- 2G${x) + GW(i - <5)| < const ^(Aan^))" 1 . 

This yields 

TV 

< const (Vj^A^x))- 1 ) 

n=0 

< const 5 2 (A 2 n (x))' 1 < const 5. (23) 
The previous estimate for S 2 , together with the bound (|23p . yields (|20p . □ 



s[ r) (x + 5)-2S{ r) (x) + s{ r \x + 5) 



4 Lorentz operators and simultaneous approximation 

In the following three sections, we restrict attention to a ^ IN. We now recall the main ingredients 
of the valid proof of Result [7] (Theorem 1 from [3]). That proof is based on the Taylor expansion 

/(*) = / (-)- E ^ (~ - *Y f U) (*) + L ] (-- X ) r [/ (r) W - / W (&)]. (24) 

where := £,k(x) is a point between x and fc/ra and / is assumed to be r times differ entiable. This 
formula is used in [3] to derive an asymptotic expansion of the Bernstein operator 

(B n f)(x):=j2f(-)pnk(x), 
k=o 
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where the polynomials p nk are defined in ([T]). Multiplying the Taylor expansion (f24"|) by p n k(x) and 
summing over k, we obtain 



f(x) = (B n f)(x)-Y / -^jT nj (x)f^(x) + (R r f)(x), where (25) 

j=l J - n 

n 

T nj (x) := ^2(k-nx) 3 p nk (x), (26) 



k=0 

n 



1 / i- \ 

(i^/)(x) := -^2(--x) [f {r) (x)-f^(C k )]p nk (x). 
r ' k=o ^ n ' 

This leads Lorentz to the natural definition of the operators Q n r , using the recurrence 

(Qn,rf)(x) :=(B n f)(x) - ]T -^T nj (x)(Q n>r -jf®)(x), (27) 

3=1 

where each f^ in (I25p is replaced by its approximation Q n ,r-jf ■ 

Note that the sum in (|27p in fact starts at j=2 rather than at j=l, since the polynomial T n \ 
is identically zero. Also note that the expressions (12Tf) must be written in the Bernstein basis of 
degree n+r, so that, e.g., the leading term B n f must be multiplied by the binomial expansion of 
(x + (1 — x)) r to appear in its Bernstein form of degree n+r. We will refer to the operators Q n ,r 
mapping a function to a polynomial in Bernstein form of degree n+r as the Lorentz operators. 

An important property of the Lorentz operators that can be inferred directly from their recursive 
definition is their polynomial reproduction. Precisely, the Lorentz operator Q n ^ r reproduces 
polynomials of degree at most r. 

Lemma 14. Let f be a polynomial of degree at most r. Then Qn^f = f. 

Proof. The proof is by induction on r. The result holds for r = and 1 since Q n .o = Q n ,i is 
simply the Bernstein operator, which reproduces linear functions. For higher values of r, the proof 
is as follows. The Taylor polynomial of / of degree r coincides with /, so 



(*), 



so by multiplying by p nk {x) and summing over k, we obtain 



fix) = (B n f)(x) - £ -L Tnj (x)fV\x). 

j=\ J - H 

By our inductive assumption, /w) = Q nr _jfv). Substituting this into ()27|) . we get Q n ,rf = f ■ D 
As noted in [3], the Lorentz operators can be rewritten as follows 



k=o \ v 7 i=2 v 7 



-^jT rj (x,n) ) Pnk(x), 
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or, more simply, as 

JL (JL ,„ /h\ 1 

,n) Pnk(x), (28) 



k=0 \j=0 v J 



with the understanding that r r o(x,n) = 1 and r r i(x, n) = 0. Plugging (|28j) into (|27|) . we obtain 
(Qn,rf)(x) = ^2 if (-) ~^2f {j) (-) ^j^2j^T nl (x,n)T r ^ij^i(x,n) j p„fc(x). 

By term-by-term comparison, this yields 

j 1 

T rj (x,n) = -^-T^(:z)T r _jj_z(x,n) for j > 2. (29) 

The recurrence (|29p can be used to show that the polynomials T r j(x,n) are independent of /, are 
of degree j in x and of degree [j /2\ in ?i, as noted by Lorentz [3]. The recurrence (f29|) also shows 
that, as functions, the r rj - do not depend on the parameter r. However, in the expression for Q n>r , 
each of the T r jS is written in its Bernstein form of degree r to bring the entire expression Q n ^ r f into 
its Bernstein form of degree n + r. Since we are mainly concerned with pointwise estimates on the 
T r jS, we will use the simpler notation :=r r j. We will begin with the following estimates on the 
polynomials Tf. 

Lemma 15. The polynomials Tj are bounded by 

\Tj(x,n)\ < const,- n> (A n (x)) j for all x 6 [0, 1], (30) 
where const j depends only on j. 

Proof. We use induction on j. For j = 0, 1 ([30]) is clear. By [3l (17) on p. 242], 

\T ne {x)\ < const^ n l (A n (x)) £ . 
Applying (|29|) and the induction hypothesis 

W > 0, \Tj_ e (x,n)\ < const j-t n j ~ £ (A n (x)) j ~ e 
gives ([30]) . as required. □ 
Corollary 16. Fix an integer r > 0. For any j < r, write 

3 



Tj(x,n) =:^2ai(n, j)x l (l - x)° \ 



i=0 



Then for all i £ [0, j], we Ziaue |aj(ra,j)| = \a,j—i(n, j)\ and \ai(n,j)\ < Cjn* /or some constants C-. 
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Proof. The polynomials T n j satisfy T n j(l — x) = (— l)-?T n j(x), as is easily seen using the 
substitution k = n — k in their definition (12(31) . It then follows from the recursion (I29p that 
Tj(l — x,n) = (— l)- 7 rj(x, n) as well, and this implies that \a,i(n,j)\ = \aj-i(n,j)\ for all i. Next, 
consider the polynomial A(x) := Yli=o a i{ n i j) x% ■ Since Tj(x,n) = (1 — x) J A(j^), Lemma E] 
implies that < const.,- for x G [0, 1/n]. Thus A*{x) := ^4(^f) satisfies < constj for 

x £ [—1,1]. Markov's inequality ||oo < j 2 *| 1^*1 loo (see [U Chapter 4, Theorem 1.4]) yields 



\ ai (n,j)\ = (i!)"V W (0)| = (i!) _1 (2n)V* (-1)1 < C\n { for all i < j 



□ 



For our next argument, we will need an additional technical lemma that provides bounds on 
the derivatives of the functions p nk . 



Lemma 17. For any integer I > and any f3 > 0, 



E 

fc=0 



X 



n 



P { nl{x) < coustp )t {A n {x)f~ E for all x G [0, 1). 



(31) 



Proof. The proof is by induction on £. The proof for I = is due to Lorentz [3j Lemma 1]. Our 
proof of the bound (|3ip for £ > 1 splits into two cases. 

Case 1 . A n (x) = y/x(l — x)/n. In this case we start from the equality 

i r \ ^ ^ix i \ 

Pnk(x) = x f 1 _ x f nk(x), 

and deduce by induction on £ that the £th derivative of p nk has the form 

l (k — nx) v 



i,j,u>0 



(32) 



where ^nj v {x) are polynomials in x with coefficients that do not depend on n. For fixed i, j, v, we 
have (using that x(l — x) > A n (x) in this case) 



n % \k — nx\ v 



\z: — X\ 



; . = (A n (x))- 2 ^ r ; n . < (A n (x))-^+^ 

whence (using the already established case t = of (|3"T]) ). we obtain 



E 

fc=0 



[x(l - x)\i 



r p nk (x)<(A n (x))-^+^J2 



k=0 



X 



n 



Pnk(x) < const j3 je (A n (x)) 



P-i-j 



The restriction i+j < I implies that the right-hand side of the last display is at most const ( g i ^(A ri (x))^ 
The representation (|32p completes the proof in this case. 

Case 2 . A n (x) = 1/n. In this case we substitute a different expression for p' nk (x), precisely 

p nk (x) = npn-t fc-i(x) - npn-t k (x), 
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which yields 

This gives the bound 

n 

E 



Pnl( X ) = n Pn-l'k-l( X ) ~ n Pn-l!k( X ) ■ 



,(<-!) 



(e-i) 



k=0 



n 



k / 

< const w nj] - - x ( P?^-!^) + Pn-l?k( x ) 



fc=0 



The general inequality (a + b)P < 2^(aP + b@) implies that 



k 


— X 




k 


- 1 


n 


n 


- 1 


k 


— X 


P , 
<2^( 




k 


n 


n 


- 1 



+ 



+ 



Therefore E^J? =0 



- - x 



Pnk( X ) 



n — k 
n(n — 1) 
k 



n(n — 1) 
is at most 



< 2> : 



< 2* 



k-l 



n — 1 
k 



n — 1 



x 



+ -s) for k > 1, 



H 3- ) for < n. 



fc=i 



k-l 



n — 1 



n-1 



fc=0 



Since A n _i < A n < A n _i, we can finish the proof using the inductive assumption on I — 1. 



We now generalize Lemma [15] to derive bounds on the derivatives of the polynomials Tj . 
Lemma 18. The derivatives of the polynomials T n j and Tj are bounded as follows 

\T$(x)\ < const^ n^'(A n (x))^ 
\Tf\x,n)\ < const,-,* n J (A„(x))^ 

for all x G [0, 1]. 

Proof. Differentiate the formula (1261) £ times to obtain 



-n) 



(j - ^)! 



(33) 
(34) 



m<min{j',^} k=0 

By Lemma [T71 each term is bounded by 

n m . n j-m . const (A n (x)) (j - m) - (f - m) = const n j {A n {x)) j ~ e , 

which proves the estimate (I33p . To get the analogous estimate for derivatives of Tj = T r j, we run 
an inductive argument. Differentiating the formula (|29p i times, we get 



-f(*>»> = -E(f)E^>(*) 

m <* \ / S =2 



(35) 
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Applying the inductive assumption on the derivatives t^_^ (x, n) and the already proven bound ([33 
on TJis (x), we obtain the estimate 

const j , e n s (A n (x)) s - m ni- s (A n (x)y- s - e+m 



const, i n 3 (A n (x)y 



on each term in the sum ()35|) . and therefore on the function |r- (x, ra)| as well, proving ([3 



□ 



Next, we will show that the derivatives of the polynomials Q n ,rf approximate the corresponding 
derivatives of / sufficiently well. This is known as simultaneous approximation. Here is the 
precise result. 

Lemma 19. Let f £ C a [0, 1] and let r :=\a]—l. Then, for any j = 0, ... ,r, 

W - Qn,r)f) U) (x)\ < qjg\f\\ C «(A n (x)) a -i for all x G [0, 1], 
where the constant is independent of f and n. 

Proof. The case j = of this lemma is contained in formula (22) of [3]. To prove the result for 
all j, we use the fact the Lorentz operator Q n ,r reproduces polynomials of degree at most r. 

Our goal is to show that the jth derivative of the difference between Q n ^ r f and / at any point 
x is bounded by a constant multiple of \\f\\c^{A n {x)) a ~' J regardless of x. Since Q n , r reproduces 
polynomials of degree r (by Lemma fT^|) . we can subtract from / its Taylor polynomial of degree r 
centered at x without changing the difference (Q n ,rf — f)^\ x )- Thus, without loss of generality 
we can assume that the value of / and its derivatives up to order r are zero at x. Now, recall that 



«hrf - /> fa, M = (t (/ (;) - m) wM + 1 1 / (,) (;) I 

\k=0 ^ ^ J ' i=2 k=0 V 7 



U) 



jTi(x,n)p nk (x) 



Differentiating these sums j times, we will obtain terms of two kinds. Terms of the first kind are 
obtained from differentiating the first sum; they have the form 

(0 



k=0 

for some £ between and j. Each of these sums can be bounded as follows, using Lemma [T71 

-£ 



k=0 



f( X ) ) Pnk 



«- £ \x) 



k=0 
< const 
= const 



Qa 



k 
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X 



W{x)\ 



c<*(A n (x)) 
c«(A n (x)) a -i 



\Pnk 
(a-£)-(j 



Terms of the second kind are obtained by differentiating any of the other sums for i = 2, . . . , r and 
have the form 



k=0 



n I n 
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for some £ between and j. Taking into account that the derivatives of / up to order r vanish at 
x, each of these sums can be bounded by 



fc=o 



k\ 1 



,CM)/ 



n l n 



n 

k=0 



C a 



k 

x 

n 



-r\T^\x,n)\ p% k l \x) 



n 



Invoking the bound (|34p from Lemma [TS1 on the terms \rf'(x, n)|, we conclude that the total is 
bounded by 



1 

const \\f\\ c «-n\/\ n (x)y- l Y; 



k=o 



k 

X 

n 



\pV\-)\- 



The last sum, in turn, is estimated according to Lemma [T7l to produce the final bound 

const \\f\\ ca {/± n {x)r" • Ar H+; (i) = const (A n (x)) a ^ . 

This completes the proof. □ 
Lemma 20. Let f £ C a [0, 1] and let r :=\a] - I. Then, for all x £ [0, 1], 

I (Qn,,/) (r+1) (x)\ < q^A^r-^WfWcc , (36) 
with the constant C^2U\ independent of f and n. 



Proof. Firstly, we may assume without loss of generality that / vanishes to order r at x, since 
polynomials of degree at most r are reproduced by Q n ,r (Lemma ll4p and then annihilated by taking 
the derivative of order r + 1 , as well as by taking the rth derivative followed by a difference at two 
points x and y. The assumption made above implies that, for all i < r, 



By direct differentiation of ([28 



(Q n ,r/) (r+1) W=EE 
i=0 1=0 



r r+1 



< const / C a 



k 

x 

n 



r + 1 



fc=0 



n I n l 



(37) 



Fix i £ [0, r] and £ £ [0, r + 1]. The summand corresponding to i and ^ in ([37]) can be bounded by 



k=0 



it 

fc=0 



k 

x 

n 



^ } (x) . (38) 



Invoking Lemma [TBI we note that the terms ~\T^\x,n)\ are bounded by a constant multiple of 
(A n (x)y~ e , therefore (|38|) is bounded by 

const ||/||c4An(^)) l "'(A n (x)) Q -'- r - 1 ^ = const \\f\\ Ca (A n (x?a - r - 1. 

This proves ([36]) . □ 
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Lemma 21. Let f G C a [0, 1] and Zef r :=|~a] — 1. Then, /or any x G [0, 1], 

IIQ»,r/llo- < qsjl/llo- , 

with the constant Cfej\ independent of f and n. 

Proof. To establish the bound (I39p . we need to estimate the expression 

(Qn,rff\x)-(Q n ^f\y) 



(39) 



(40) 



for two points x and y in [0,1]. With loss of generality, we may assume that A n (x) > A n (y). 
Consider two cases. 

Case 1 . If \x — y\ > A n (x), then we estimate (|40p using the triangle inequality and the bound 



\(Qn,rf-f) (r) (x)\ < COnst(A n (x)) C 



from Lemma [T9l on each of the two terms, (Q n f — f)^ r \x) an d (Qnf — /) (y)- Altogether, this 
bounds (|40p from above by 



const \\f\\c«(&n(x)) r - a < const \\f\\ c <*\x - y\ r ~ a . 
Case 2 . If |x — y\ < A n (x), then \x — y\ < (A n (x)) r+1 ~ a \x — y\ a ~ r so for some £ between x and y, 

(Qn,rf) ir) (x)-(Qn,rf) {r) (y) 



= (Qn,rf) ir+1) (0-\x-y\ 

< const(A n (0) a " r " 1 ||/||c- • (A n (x))^ +1 - Q 
Lemma [12] implies that A n (x) < 4A n (£), and inserting this bound in (|41[) establishes ([39 



x-y\ a ~ r . (41) 



Lemma 22. Suppose that f : [0, 1] ->• IR satisfies |/ (r+1) (^)| < (A n (x))~^ /or some /3 G [0, 1] and 
aZZ x G [0, 1]. Then, for all x G [0, 1], we have 



(r+l) 
Qn,rf I yx , 



< 



(42) 



with 



r, /3) a constant independent of f and n. 



Proof. To prove (|42p . we may assume as in the preceding theorem that / vanishes to order r at 
x. This implies that for all i < r and z ^ x in [0, 1], there exists £ between x and z such that 



W ^j- < l/ (r+1) (OI < < 2(A n (x))^(l + |: 



z — X 



A n (x)J> 



(43) 



where the last step used ([9]) taken to the power /3, and the inequality \x — £| < |x — z|. 

Recall the expression ([37]) for (Q n ,rf)^ r+l \x). Fix 2 G [0, r] and £ G [0, r + 1]. The summand 



fe=0 v J 
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corresponding to i and £ in (|37p can be bounded using f|43|) and Lemma fl5l by 

r+l—i , I fe 



const 



fc=o 



k 

a; 

n 



(A n (z))-^(l + 



A n (x) 



(A„(x)) 1 



(44) 



Invoking Lemma [T71 twice, we conclude that (|44p is bounded by 

const(A n (x))-^(A n (x)) J -^(A n (x))^ i + ( A n( x )Y ^ const(An(x)) -^ 

This proves the lemma. □ 

5 The iterative construction 

The goal of this section is to prove the suffiency part of Theorem [HJ This will be achieved via 
an iterative construction of the approximants /„ that are subsequently adjusted downward and 
upward to produce the approximants g n and h n satisfying the required consistency conditions 
listed in Result [3] in the Introduction. We begin by analyzing the behaviour of the degree n + r 
Bernstein coefficients of Q n ,rf ■ 

Lemma 23. For every e > 0, there exists such that for n > uq, the degree n + r Bernstein 
coefficients of Q n ,rf ar e between min^i] / — e and maxr 0; i] / + e . 



Proof. Recall that 



Qn,rf 



n 



k=0 \j=0 



2J T j( x > n ) | PnA x ) ■ 



Note that the ith Bernstein coefficient of Tj{x^n)jv? G Bj is bounded by C| min(n _ ', n l ~i) by 
Corollary [TCI This implies that the Bernstein coefficients of /^)(^)rj(x, n)/n J for j > 1 do not 
exceed maxi<j< r ||/^'^ ||oo- Since the Bernstein coefficients of Xlfc=o fi^)Pn,k(x) are between 
rninrou / and max^j] /, we conclude that the Bernstein coefficients of Q n ,rf G B n + r are between 
miri[ 0)1 ] /- £2 ^maxi<j< r ||/ (j) ||oo and max[ 0) i] / + maxi<j< r ||/^||c») from which Lemma[23] 
follows immediately. □ 

Lemma 24. Let r:=\a] - 1. If f {j) {x) < (A n (x)) a ~ j {j = 0,...,r) for all x G [0,1], then the 
degree n + r Bernstein coefficients of Q n ,rf o,re dominated by those of 



C^x + (l-x)] r 
where O^J\ does not depend on n and f . 



E 

k=Q 



n 



Pn,k( X ) 



Proof. This lemma is a bit trickier. Separating the contributions given by different j = 0, . . . , r, 
we see that it would suffice to bound the coefficients of 



fc=o 



E K 



a-j 



Pn,k( X ) 



Tj( X ,n) 



G B 



n+j 
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by those of [X)fc=o (^n {71})° Pn,k( x )][ x + (1 — X )Y (possibly with some constant factor). Since 
both polynomials have symmetric coefficients as Bernstein polynomials in B n +r and since we may 
assume without loss of generality that n > 3r, we see that it is enough to prove that 



e (;) M3rw.y- I}£C e «))■ * 

— \ / o — I — i 1 1 \ / 



S+t=« V ' S+t=U 

0<s<u 0<s<u 
0<t<j 0<t<j 



n + r 2n 

2 - T 



Note that A n (^) is comparable to A n (^) for |s — u\ < j < r. This allows us to reduce the 
inequality to 



A£ ( - 

n 



( jmin{n *,n* j } < const ^ ("j 

S+t=U S + t=M 

0<s<u 0<s<u 
0<t<j 0<t<j 

We shall keep just one term (™) (A n on the right and use the estimate (") = ( u ™ t ) < 

3* (^)* (") valid for it < ^p, i > 0. Since t < j < r, we only need to show that 

©Wv->— {^}^(A„ey. 

But (A n > J_, which takes care of u = (with the only possible t = 0), and if 1 < n < 

^p, we have A n (^) > C^-, so it suffices to prove that min|^,^j| < or equivalently, 

min{(^-) 5 , u*"} < 1. But the first term is less than 1 for t > | and the second term is not 
greater than 1 for t < |. □ 

Iterative construction of /„ 

Let a > 0, a Z, hence r = [aj- Assume that / £ C a [0, 1] satisfies 

< min f < max f < 1 . 

[0,1] " [0,1] 

Fix no S IN and b = 2 s to be chosen later. Denote A :={b m no : m > 0} and define / n for n € A by 

/no := Qno,rf > 
fn ■= fn/b + Qn,r(/ ~ /n/b) for n > 7l . 

Our task is to show that f n — > /, that the Bernstein coefficients of f n are between 5 and 1 — 5 
for some 5 > 0, and that the Bernstein coefficients of Q n ,r{f — f n /b) are dominated (up to some 
constant factor) by those of 



0<fc<n 

We will do it in four steps. 



[x+o-x)y[ ]t UJ-\X Pnyk { X ) 
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Step 1. Estimate for fn - We will show by induction that 



l/^ r+1) l<2CfoD]||/||c«(A n r— 1 on [0,1], (45) 



provided that b is chosen large enough. By Lemma |2"0"] the inequality (|45|) holds for n = uq. Assume 
that it is true for n/b in place of n. Write 

< \f§ X) \ + \(Qn,rf n/b ) [r+1) \ + I (Qn,rf) (r+1) I ■ 

According to Lemma f20[ the last term is bounded by Cj2gj|/||c" (A n ) a ~ f ~ 1 , By the induction 
hypothesis, 

i/^i < 2c^\f\\cA^n /b ) a - r - 1 < 2b( a - r - i y 2 q^\f\\ c °(A n r- r - 1 

whence by Lemma [22] (with proper renormalization) 

\(Qn,rfn/ b ) ir+1) \ < 2q$^Wq$\f\\(»{* n )^ 1 ■ 
If b is chosen so large that 2(1 + C^^b^ a ~ r ~ 1 ^ 2 < 1, we can add these three estimates to get 

|/i r+1) |<2^o]||/||c a (A n r— !. 

Moreover, we see that 

IP " Qn,r)f n/b ] {r+1) \ < 2(1 + C^b^-^C^lfWc^nT- 1 "- 1 ■ (46) 

Step 2. An estimate for 1 1 1 1 c a • We will show that 

||/n||c Q — 2Cj^j]||/||ce», provided that b is large enough. 



Again, we will argue by induction. Lemma [21] yields the base case n = hq. Assume that the 
statement is true for n/b. Write 

fn = {J ~ Qn,r) fn/h Qn,rf • 

Since ||Qn,r/||c Q — Cj2T]ll/l|c a by Lemma [2TT it suffices to show that the C Q -norm of the function 

*:=(/- Q n ,r)fn/ b 



is bounded by C^jj|/||c«- 

We need to estimate \^^ r \x) — ^ r \y)\. Without loss of generality, we may assume that 
A n {x) > A n (y). Choose a big positive constant A and consider two cases: 

Case 1. \x-y\> AA n (x). Then 

< |*W(x)| + |*W(y)| 

< 2q^xj|/|| c «((A n (x))^ + (A n (y))°- r ) 

< 4C^i^ a - r ^\f\\c a \x-yr r 
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and we get the desired bound if Cj^gjA ( a r ) < 1. 
Case 2. \x — y\ < AA n (x). For this case, we will use the estimate 

|M/( r+1 )|<2(l + q2 2 ])6^" 1 )/ 2 ^o]||/|| CQ (A n ) 

obtained in (H6l). Write 



a—r—l 



- ^ {r) (y)\ = \y {r+1 \o\\x - y\ 

for some £ between x and y. Now, by Lemma [T2l A n (£) > [2(1 + A)]~ 1 A n (x). Combining this with 
the above estimate for we obtain 

l* (r+1) (OIN - V\ < 2(1 + C^2])& (cl - r - 1)/2 C^o]ll/ll^[2(l + A)] r+1 - a A r+l - a [AA n {x)] a - r - l \x - y\ 
< 2(1 + «[2A(l + A)r +1 - a &(— ^CbDjl/llc-N - y|' 



la— r 



and we get the desired conclusion if 

2(1 + Cfe5j)[2A(l + ^jH-i-^a-r-iya < j _ 



Step 3. An estimate for (/ — f n )v\ Since / — /„ = (I — Q n ,r)(f — fn/b) f° r n — ^ n o an d we 
know that ||/ n /f,||c Q < 2Cj^j]||/||c l «, we can invoke Lemma [191 to conclude that 

\(f-fn)®\<C^l + 2C^\\f\\ c «(A n ) a - i for n>6n . 

The same, or an even better, estimate can be derived for n = no from the representation / — f no = 
(I — Q no )f. In particular, we see that f n —tf uniformly in [0, 1]. 

Step 4. Estimates for Bernstein coefficients. It follows now from Lemma [241 and the result 
of the previous step that the degree n + r Bernstein coefficients of Q n ,r(f — fn/b) are dominated by 
those of 

q^qtgi + 2C^b a / 2 \\f\\ ca [ X + (i - x)f [£(A„r - p n ,k{x) 

(here we used the inequality (A n/b ) j < bi/ 2 (A n y < b a/2 (A n ) j for < j < r). Since the latter 
coefficients are bounded by 



a/2 

we see that the degree n + r Bernstein coefficients of f n differ from those of f no at most by the 
factor 

const ^2 n~ a l 2 < C* • n Q a ^ 2 . 

n£A,n>no 



Now, fix 5 > such that 



25 < min f < max f < 1 — 26 

[0,1] " [0,1] 



and choose no large enough so that C* • n a ^ 2 < 5 and the degree n + r coefficients of Q no f are 
between 25 and (1 — 25), which is possible by Lemma[23j Then the degree n+r Bernstein coefficients 
of f n are between 5 and 1 — 5 for all n E A such that n > no- 
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Step 5. Construction of g n and h n . Set 

( Pn{x):= — + 



n' 



x(l — x) 



11 



a/2 



where 6 a will be specified later, and define 

9n ■= fn-[x+{l- x)] r DB n ip n , h n := f n + [x + (1 - x)] r DB n ip n . 



(47) 



(48) 



The constant D here is to be chosen later. Clearly, the degree n+r Bernstein coefficients of h n are 
greater than those of g n . Also, since \<p n \ < dD^ 1 for large n, we see that the Bernstein coefficients 
of g n are positive and those of h n are less than 1 for sufficiently large n. It remains to show that 
the Bernstein coefficients of g n "increase", those of h n "decrease" and that g n — h n = 0((A n ) a ). 

Lemma 25. The functions ip n defined in J^7[ ) satisfy 

Bn^Pn < const ip n < const(A n ) a . 

Consequently, h n — g n = 0((A n ) a ). 

Proof. Since (p n is comparable to (A n ) a , it suffices to show that 



fc=0 

Recall that, by Lemma [12l 

< 2° i 1 



p n>k (x) < C(A n (x)) a 



A n I - 



la? — — I 

J re] 

A n (x) 



(A n (x)r<2' a ((A n (x)) a + 



k 

x 

n 



Now, the first term yields the sum 

n 

2 2a (A n {x)YY,PnA*) = 2 2 °(A n (x)) c 
while the second one yields the sum 



k=0 



k=0 



k 



n 



Pn,k{x) < const(A n (x)) Q , 



due to Lemma [T71 The desired bound h n — g n = 0((A n ) a ) now follows from ([38" 
Next, we want to show that 

{B n Lp n ) ^2n (1 + l)B 2 nV > 2n , 



(49) 



with some 7 > 0. To perform this comparison, we multiply the left-hand side by [x + (1 — x)] n and 
expand. We thus see that this claim is equivalent to the system of inequalities 



E 

i=o 



k (n\ I n \ 
j) Kk-j) 



n 



k 



<Pn - > (1 + 7)^2n 77- , for < k < 2n . 



2n 
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by (7k j- Note that (p n > 2 a ^ 2 (p2 n , which immediately takes care of 



Denote the coefficients — — uj u k ^. 

\ k ) 

k = and k = 2n with any 7 < 2 a / 2 — 1. So, we will assume below that 1 < k < 2ra — 1. 
Note that the function 

T(x) :=[x(l - x)] q/2 



satisfies the inequality 



T(x + i) + T(x-t) 



> T(x) 



min{x, 1 — x} 2 



(50) 



for < t < minjx, 1 — x}, provided that c a is large enough. 

Indeed, when < t < | min{x, 1—x}, (|5U|) follows from the estimate |T"(£)| < const XIeL 

valid for all £ € [x — t, x + t], and when 



min{:r,l— a;} 



J- 



— min{x, 1 — x} < t < minjx, 1 — x} , 

the inequality (|50p is trivial, provided that c a > 4. Taking into account that a^j = (Jk,k-j, 
Y,j <?k,j = 1 and that a k ,j = if | £ - ^| > min{^, 1 - ^}, we obtain 



£)>T| 


( k 




K 2n 




f k 


= T 






K 2n 




t k 


> T I 






s 2n 



c a n 



min{£;, 2n — k} 2 



2-^ ° k j \ n On 



c a k(2n — k) 



min{£;, 2n - k} 2 A(2n - 1) 
4min{/c, 2n — k} 



because one of the factors k and 2n — k equals min{A;, 2n — k} and the other one does not exceed 
2n — 1. Thus 



n" re- 



> ^ + — ttT — 



n a n a/2 



2/1 



4min(fc, 2n — k) 



We have to compare that with 



( 1 + ^ 2 «( Yn 



+ 



T| — 



(2n) a (2n) a / 2 \2n 



(1 + 7) • 



Clearly, Jg. - ^ffi >(l-^ 2 )^f>0if7<2 Q -l. Subtracting the second terms, we get 



[ — 



1 



2n n a / 2 



1 + 7 c _a 

2 q/2 4mm(k,2n-k) 



which is non- negative if 7 < 2 a ' 2 — 1 and if k or 2n — k is larger than some constant = K* (a, 7). 
But, for min(fc, 2n — k) < K*, we have T(£) < [f^] ^ 2 and, thereby, the difference is (in absolute 
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value) at most £f £ a , which is dominated by (1 j^?)^!, provided that 9 a was chosen large 

enough. This proves (09]). 

An immediate corollary is that 

{Bn/bVn/b) hn (1 + l)B n (p n 

for every n G A\{no}. Thus, the Bernstein coefficients of 

[x + (1 - x)Y(B n/b <p n/b )[x + (1 - x)f- n l b - [x + (1 - *)r^ n 

are at least as large as those of j[x + (1 — x)] r B n (p n . Since <p n > (A n )° (provided that 9 a > 1, 
of course), we see that the latter dominate the Bernstein coefficients of Q n ,r(f — fn/b) with some 
small constant. Choosing D large enough, we turn this into true domination, which finishes the 
proof of "monotonicity" of the Bernstein coefficients of g n and h n . 

6 Revisiting the claim of Lorentz 

The goal of this section is to demonstrate that Lorentz' Claim [10] made in [3] is invalid. Our 
counterexample will be constructed in several steps. We begin with some elementary observations 
about Bernstein polynomials. 

Lemma 26. Let B+[a,b) :={Y,k=o c k( x ~ a ) h ( b ~ x T~ k ■ c k > 0}- Then 

(a) B+[a,b] C B+ +1 [a,b], 

(b) B+[a,b]-B+[a,b]cBt +m [a,b], 

(c) B^[a, b] C B^[c, d] for every subinterval [c,d] of the interval [a,b], 

(d) B„[a, b] is a convex cone of functions. 
Proof. 

(a) Multiply by 1 = [(x — a) + (b — x)] and distribute. 

(b) Multiply out. 

(c) x - a = (c- a) + (x - c) G Bq[c, d] + Bf[c, d] = Bf[c, d] and 
b - x = (b - d) + (d - x) 6 B+[c, d] + B+[c, d] = B+[c, d] . 

Hence, {x - a) k {b - x) n " fe G Bf[c, d] n C B+[c, d] . 

(d) Obvious. □ 

Lemma 27. Suppose that p is a polynomial of degree n with real coefficients such that p(0) > 
and p has no roots in the unit disc {\z\ < 1}. Then p G 1, 1]. 

Proof. We have p(x) = a Ylpix — (3) H 7 (7 — x ) Y\\( x ~ ^)( x ~ ^) where f3 are negative roots, 7 
are positive roots, A are complex roots with positive imaginary parts, and a > 0. Now 

x - /3 = ( x + 1) + (-P - 1) and - (3 - 1 > . 
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Thus, x - P G Bf[-1, 1] for all /3. Similarly, 7 - x G Bf[-1, 1) for all 7. Now, 

(x — X)(x — A) = x 2 — 2Re(Xx) + |A| 2 is a convex combination of (|A| — x) 2 and (x + |A|) 2 . 

Moreover, since |A| - x G Bf[-1, 1] and |A| + x G 1] , we infer that (|A| - x) 2 G B^ [-1, 1] 

and (|A| +x) 2 G B+[-l,l] . □ 

2 

Lemma 28. The Taylor polynomial P 2 n of degree 2n of the function e~ x at has no roots in the 
disc {\z\ < 

Proof. Let \z\ < Then 

\>~* - fl*wi i E ^ * E (¥) ' £ E s E < * i^i • 

fc>n fc>n ^ ' fc>n ^ ' k>n 

and the result follows. □ 
In the sequel, we will make use of the inequality 



\e~ z2 -P 2n (z)\<e- n for \z\ < ^ 

e 

obtained in the course of the last proof. 

The following lemma is proved analytically, but the motivation of the construction is probabilis- 
tic. The Bernstein approximation B n f of a function / can be viewed as the expectation of f(X/n) 
where X is a Binomial random variable with parameters n and x. The Central limit theorem yields 
convergence of scaled Binomial variables to Gaussian variables, so the Bernstein approximation is 
close to the convolution of / with a suitable Gaussian variable. 

2 

Lemma 29. Suppose that is is a positive measure on JR such that g := v * e~ nx is bounded on the 
entire real line. Then there exists p n G £>J) 0n [— 1, 1] such that \\g — p n \\L°°[-i,i] — 3e _n ||p||oo. 

Proof. Note that x \— > P200n{\f n ~{x — t)) has no roots in the disc {\z — t\< — }. If \t\ < 2, this disc 
contains the disc {\z\ < 1}, so P2oon(V™( x ~ *)) e Bj) 0Tl [— 1, 1]. Now put 

Pn = H[-2,2] * P200n(Vn •) G B% QQn [-l, 1] . 

For all x G [—1,1], we have 

roo p — 2 

\g(x) - p n (x)\ = / e - n ^-*) 2 di/(t)+ / e- n( - x ~ t)2 du(t) 



oo 



+ J |e-«C*-*) a - P 200n (^(x -*))| di/(t)=:Ji + / 2 + J 3 . 

But | e - n ( x -*) 2 - P 2 oon(V^(^ - *))| < e" 100n as long as |je - t| < ^. So J 3 < e~ 100n ff 2 du(t). On 
the other hand, since 

^ 1 fV 7 " o . 1 



e"^ 2 dx = -= / e- x " dx > 

V™ ./-Vn V™ 
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the definition of g implies that 

IMIoc ■ 6 > J* g(x) dx > J ^ e~ nx2 dx J % dv(t) = J^= J* 
So h < e- 100n 12v^||5||oo < e- 95n ||g| 



dv{t) 



oo • 



Now, since for every y > 0, z > 1, we have e < e ny e n , we obtain 

poo /'OO 

h = / e-^*-*) 2 di/(t) = / e -"«'- 2 ) + ( 2 -^) 2 du(t) 

/>oo 

< J e~ n ■ e-^- 2 ) 2 du{t) < e- n g(2) < e^g^ , 

and, similarly, I2 < e - ™!!^!!^. Bringing these three estimates together, we arrive at the conclusion 
of the lemma. □ 

Corollary 30. Let E n :={i>*e~ nx2 }, where the measure v satisfies the assumption of Lemma\29\ If 
f : IR — > [0, 1] can be approximated by functions g n £ E n on the entire line with an error 0(n~ a / 2 ), 
then f can be approximated by p n £ 0+[— 1, 1] with an error 0{n~ a / 2 ) on [-1,1]. 

Proof. Obvious from Lemma [29l □ 



Now fix q £ (0, 1). Our next task will be to construct a function / : HI — > [0, 1] that is 
approximable by functions g n £ E wn with an error 0(n" a / 2 ) but is not in the class C Q [-l/2, 1/2]. 
Note that C -Ey whenever A < A', so it does not matter whether we consider only integer values 
or all real values of n in our statement. 

Fix h £ (0, 1) and m £ IN and define 



Recalling that the Fourier transform of the function x 1— > he nh2x2 is y 1— >■ e ny2 / h2 and using the 
Poisson summation formula 

^F(fc + x) = ^F(£)e M , 

fcez i<=z 

we get 

f _ --Kt 2 /h 2 -2nUx^Tn/h 

Jh,m — / e e 

This representation immediately implies that /fe, m attains its maximum at x = 0, and that 

< £ ^ 2/h2 < 26-^(1 + ^ e -^ 2 )<4e-^ 2 
^ez\{o} ^>3 

A,m(0) - A, m (^=) = 2 £ e"^ 2 = 2 £ e~^ 2 + • • • > 4e"^ 2 . 
^ v m / ^ odd |^| =1 

Also note that m £ £^-m- 
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Now let A denote the set {2 J : j = 2, 3, 4, . . .}. Choose h m so that 

e-tAm i 
(log 2 m) 2 m a / 2 ' 

This choice makes sense for m > mo(a) > 4. Define A' :={m £ A : m > mo(a)} and 

f ' / . 7i v)//i m ,m' 
(logo to) 2 

For every n 6 IN, let 

9n'- = / "Tj 7nfh m ,m + / Tj To £ -E^rm 

f-^ (logo to) 2 . (logo?™) 2 

m£A',m<n v oz 7 meA',m>n v ° J 7 

since <? n is a finite sum of elements of i^m with m < n plus a constant. Now, 

\f~9n\ < £ n 1 , 2 H/v,m-l||oo< £ n 1 ,2 4e "" /fe " 

f-^ (log 2 TO) 2 f-^ (log 2 TOj 2 

m6A',m>n v oz y meA',m>n K z 7 



4 £ "^ Q/2 < const n" a/2 . 



m£A',m>n 

On the other hand, for every m £ A', we have 

!odd 



V 2 V™/ \ / (log 2 m) 2 



Thus, 

4 m - Q / 2 

Ic" [-1/2,1/2] > 7 n> = 16/i m Q ->■ oo asm^oo, 



so / is not in the class C a [— 1/2, 1/2]. So, we have obtained a function / ^ C a [— 1/2, 1/2] such 
that it can be approximated by polynomials p n G 1, 1] at the rate 0(n -a / 2 ). 

Consider the function f(x) := /(x) • x(l — x) and the polynomials p n (x) :=p n (x) ■ x(l — x). The 
polynomials are in B^, 2 [— 1, 1] and the function / satisfies the condition 



I/O) -PnO)| < x(l - x) const n _a/2 < const ( J— — ] < const(A n (x)) a . 



Claim [10] is thus disproved. 



7 Further questions and remarks 

In this section we will make a few additional remarks on this and some related problems. We begin 
by discussing a conjectural characterization of simulation rates in case a is an integer. In that case, 
the classical problem of approximating a given function by polynomials of degree at most n already 
has a somewhat different solution, as we now explain. 
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Definition 31. Let a G IN. A function f is said to be in the smoothness class C a *[0, 1] if f is 
r:=a—l times differ entiable and the following condition holds: 
The symmetric modulus of continuity of /W 

h) := sup \fW(x + t)- 2/W(x) + - t)| 

t<h, a;e[t,i-t] 

zs 0/ order 0(h). In that case, we use the notation 

u*(fW,h) 

\\f\\C<** :=sup . 

h n> 

Remark 32. The class C a *[0, 1] is also known as the generalized Lipschitz class, and, for 
a = 1, as the Zygmund class we already defined in Section [5] (also see £21 Chap. 2, Sec. 9]). 

The characterization of the best polynomial approximation in case a € IN is then given by the 
following result. 

Result 33 (see, e.g., [TJ Chapter 8, Theorem 6.3]). Let a G IN. There exists a sequence of 
polynomials (p n ), where the degree of p n is at most n, satisfying 

\p n (x) - f(x)\ = 0((A n (x)) 2a ) for all x G [0, 1] 

if and only if f G C a *[0,l]. 

Motivated by this result on unrestricted polynomial approximation, we therefore conjecture a 
corresponding characterization of simulation rates. 

Conjecture 34. Let a G IN. Let f G C a *[0, 1] be a function bounded strictly between and 1. 
Then f can be simulated at the rate (A n (x)) Q on [0,1]. Precisely, there exist polynomials g n and 
f n satisfying conditions (i) - (iv) of Result^ and bound (0). 

In Theorem [13] of Section EJ we have already verified the converse: if / is simulable at the rate 
(A n (x)) a on the interval [0, 1] where a G IN, then / G C a *[0, 1]. 

Finally, we note that that for any a > 0, it is natural to ask which functions / can be simulated 
with a finite a moment, i.e., when is there a simulation algorithm for an /(p)-coin such that the 
number N of tosses of p-coins and fair coins it uses satisfies 

00 

E p {N a ) {=J2n a W p {N > n)) < 00. 

n=l 

We suspect that the precise criterion should involve the Besov smoothness of /, with proper at- 
tention to boundary effects; see, e.g., [H pp. 54-57] for the definition and basic properties of Besov 
spaces. 
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